Search CORE

133 research outputs found

Word Recognition with Deep Conditional Random Fields

Author: Chen Gang
Li Yawei
Srihari Sargur N.
Publication venue
Publication date: 04/12/2016
Field of study

Recognition of handwritten words continues to be an important problem in document analysis and recognition. Existing approaches extract hand-engineered features from word images--which can perform poorly with new data sets. Recently, deep learning has attracted great attention because of the ability to learn features from raw data. Moreover they have yielded state-of-the-art results in classification tasks including character recognition and scene recognition. On the other hand, word recognition is a sequential problem where we need to model the correlation between characters. In this paper, we propose using deep Conditional Random Fields (deep CRFs) for word recognition. Basically, we combine CRFs with deep learning, in which deep features are learned and sequences are labeled in a unified framework. We pre-train the deep structure with stacked restricted Boltzmann machines (RBMs) for feature learning and optimize the entire network with an online learning algorithm. The proposed model was evaluated on two datasets, and seen to perform significantly better than competitive baseline models. The source code is available at https://github.com/ganggit/deepCRFs.Comment: 5 pages, published in ICIP 2016. arXiv admin note: substantial text overlap with arXiv:1412.339

arXiv.org e-Print Archive

Crossref

Joint Visual Denoising and Classification using Deep Learning

Author: Chen Gang
Li Yawei
Srihari Sargur N.
Publication venue
Publication date: 04/12/2016
Field of study

Visual restoration and recognition are traditionally addressed in pipeline fashion, i.e. denoising followed by classification. Instead, observing correlations between the two tasks, for example clearer image will lead to better categorization and vice visa, we propose a joint framework for visual restoration and recognition for handwritten images, inspired by advances in deep autoencoder and multi-modality learning. Our model is a 3-pathway deep architecture with a hidden-layer representation which is shared by multi-inputs and outputs, and each branch can be composed of a multi-layer deep model. Thus, visual restoration and classification can be unified using shared representation via non-linear mapping, and model parameters can be learnt via backpropagation. Using MNIST and USPS data corrupted with structured noise, the proposed framework performs at least 20\% better in classification than separate pipelines, as well as clearer recovered images. The noise model and the reproducible source code is available at {\url{https://github.com/ganggit/jointmodel}}.Comment: 5 pages, 7 figures, ICIP 201

arXiv.org e-Print Archive

Crossref

Computational Intelligence In Digital Forensics: Forensic Investigation And Applications

Author: Abraham Ajith
Azah Kamilah Muda
Srihari Sargur N.
Yun-Huoy C.
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2014
Field of study

The Series "Studies in Computational Intelligence" publishes new development and advances in the various areas of computational intelligence - quickly and with a high quality. The intent is to cover the theory, applications, and design methods of computational intelligence, as embedded in the fields of engineering, computer science, physics and life science, as well as the methodologies behind them. The series contains monographs, lecture notes and edited volumes in computational intelligence spanning the areas of neural networks, connectionist systems, genetic algorithms, evolutionary computation, artificial intelligence, cellular automata, self-organizing systems, soft computing, fuzzy systems, and hybrid intelligent systems. Of particular value to both the contributors and the readership are the short publication timeframe and the world-wide distribution, which enable both wide and rapid dissemination of research output

Universiti Teknikal Malaysia Melaka (UTeM) Repository

Integrating diverse knowledge sources in text recognition

Author: BOUCHARD D.C.
DEWEY G.
DOSTE
DUDA R.O.
HARI S.N.
HULL J.J.
Jonathan J. Hull
KNUT~ D.E.
MUNSON J.H.
MUTH F.E.
NEUHOFF D.L
Ramesh Choudhari
Sargur N. Srihari
SHX~CGHAL R.
SHZNCHAL R.
SRIHARI S.N.
TOUSSAINT G.T
~Y G.D.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref